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1.  Introduction 


The  recursive  nature  of  logic  programs  has  long  been  the  subject  of  optimisation  techniques 
(2}[7).  Recently,  the  database  community  has  taken  interest  in  extending  the  expressive  power  of 
relational  algebra  by  augmenting  it  with  function-free  Horn  style  logic  queries.  This  extension 
has  led  to  various  optimisation  techniques[2}[7).  It  seems,  almost  invariably,  these  techniques  are 
most  efficient  in  the  processing  of  linear  recursive  queries.  For  this  reason,  there  is  a  genuine 
interest  in  identifying  those  classes  of  non-linear  recursive  queries  which  can  be  rewritten  as  linear 
queries.  Among  these  classes  are  binary  chained  purely  exponential  queries[4]  and  doubly  recursive 
queties[9]. 

In  this  paper,  we  provide  a  sufficient  condition  for  a  subclass  of  purely  exponential  queries 
to  be  equivalent  to  linear  queries.  This  subclass  properly  contains  the  class  of  binary  chained 
purely  exponential  queries.  In  addition,  as  a  by  product  of  this  work  we  construct  a  very  natural 
progressively  weaker  infinite  chain  of  template  dependencies 3). 

t.  Preliminariei 

A  literal  is  an  expression  of  the  form  P[X j,  ....  Xm),  where  P  is  a  predicate  symbol  of 

arity  m  and  X|’s  ate  either  variables  or  constants.  A  role  is  a  formula  of  the  form 


p(x„*,,...,xm) : - n,r . n.m), 


n,t . n,m) . <?»(*», i,  Yn,» . y»,m). 


where  P(Xx,X j, ...,Xm)  and  Yi,i . Fi.m )  for  i  =  1,2,...,»  are  literals.  A  rule 

is  recursive  if  P  =  Qx  for  some  «.  A  recursive  rule  is  linear  if  there  is  exactly  one  occur¬ 
rence  of  P  on  the  right-hand  side.  We  will  call  P(X x,Xt,...,Xm)  the  head  of  the  rule  and 

. . y*,™) _ ,y»,m)  the  body  of  the  «ie. 

The  variables  appearing  in  the  head  of  the  rule  are  called  distinguished  variables,  and  all  other 
variables  are  called  nondistinguished.  We  will  also  make  the  assumption  that  all  predicates  except 
P  are  denoting  base  relations  (i.e.,  relations  explicitly  stored  in  the  database). 


Intuitively,  a  rule  states  that  the  tuple  (Xx,  Xx, . . . ,  X„)  is  in  P,  if  tuples 
(^1.1 »  *  •  •  i  m)» (I'm. I'm. * •  • . •  ••.  and  (Vn,i .  1b,j. •  •  •.  Fn,m)  are  in  QitQ},...  and  Q* 
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respectively.  A  logic  program  is  a  finite  sequence  of  rules  to  be  interpreted  as  a  finite  disjunct  of 
rules. 

Example  2.1  Let  R  denote  a  binary  base  relation,  then  the  following  program  represents 
the  transitive  closure  of  R: 


(?i  :  T(X,Y)  :-T(X,Z),  T(Z,Y). 

(/)  < 

{  r ,  :  T(X,Y )  :  -  R(X,  Y). 

Rule  states  that  every  tuple  in  R  is  also  in  7,  while  rule  ?!  states  that  all  other  tuples  in  T 
should  be  obtained  by  the  composition  of  tnples  in  7. 

Although  logic  programs  in  general  are  evaluated  via  resolution  methods,  logic  queries  in 
database  settings  are  evaluated  by  fixed  point  techniques  due  to  the  restricted  form  of  these 
queries.  We  will  demonstrate  this  technique  using  the  program  given  in  example  2.1.  Assume 
that  the  base  relation  representing  R  is  {(o,  6),  (6,  d),(c,f)}. 

Stepl:  7=1  ,*  =  {(«,  6),  (h,c),(c,/)}. 

Step  S:  Place  the  current  values  of  T  and  R  from  step  1  into  the  bodies  of  rules  rt  and  ?!. 
As  the  current  value  of  T  is  9,  rule  ?i  will  not  produce  any  tuple,  while  rule  ?i  will  add  the  current 
value  of  R  to  7,  i.e., 

7  =  {{a,  b),  (b,  c),  (c,  /)}, 

R  =  {(a,h),(6,c),(c,/)}. 

Step  3:  Place  the  current  values  of  7  and  R  from  step  2  into  the  bodies  of  rules  ?i  and  ?i- 
Rule  ?!  will  not  add  any  new  tuples  to  7.  Rule  ?j  will  add  tuples  (a,  c)  and  (6,  /)  to  7,  as  these 
two  tuples  are  the  result  of  taking  join  over  the  attribute  Z  and  then  projecting  over  the  attributes 
X  and  Y.  Hence, 

7  =  {(o,  b),  (b,  c),  (c,  /),  (o,  c),  (6,  /)}, 

R={(n,6),(6,c),(c,/)}. 

Step  4:  Place  the  current  values  of  7  and  R  from  step  3  into  the  bodies  of  rules  ?t  and  ?s. 
Again  rule  ?t  will  not  add  any  new  tuples  to  7,  while  rule  ?i  will  add  the  tuple  (a,  /)  to  7.  Hence, 
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T  ={(«,  *),  (k,  c),  (c,  /).  (a,  c),  (k,  /),  («,  /)}, 
A={(o,k),(k,c),(c,/)>. 


5<ep  5;  Place  the  current  mines  of  T  and  R  from  step  4  into  the  bodies  of  rales  ri  and  r*. 
At  this  time,  neither  rale  produces  any  new  tuples.  The  procedure  terminates  and  the  transitive 
closure  of  A  is  taken  to  be  the  last  value  of  T  from  step  4. 

The  above  procedure  will  always  terminate  due  to  the  existence  of  the  least  fixed  point[l]. 
The  pa  rely  exponential  programs  are  defined  to  be  programs  of  the  form{4]: 


S[Xt,Xt . Xm) 

S(X,,  Xm) 


s(y,.„yM . y«.m),  •,s(yw,„yn,„ 

A(XUX* . xm). 


(*) 


The  class  of  purely  exponential  programs  contains  a  large  number  of  natural  examples  such 
as  transitive  closure  and  Cartesian  products.  Moreover,  the  recursive  rules  in  purely  exponen¬ 
tial  programs  are  essentially  template  dependencies  as  defined  in  [S][8].  We  say  that  a  purely 
exponential  program  is  kinary  chained  if  it  has  the  following  form: 


S(X,y)  S(X,Zi),  SiZuZj),  S(Zn-i,Y). 

Spr.y)  A(x,y). 

Two  programs  P\  and  Pj  defining  predicates  Si  and  5j  using  the  same  set  of  base  relations  are 
equivalent  if  both  Pi  and  Pj  produce  the  same  relation  for  Si  and  Sj  for  all  values  of  the  base 
relations.  For  example,  the  program  given  in  example  2.1  is  equivalent  to  the  following  program: 

:  T(X,Y)  :-  R{X,Z),  T(Z,Y). 

:  T(X,Y)  :-  A(X.y). 

One  approach  in  establishing  equivalence  is  to  expand  the  recursive  predicates  Si  and  Sj  into 
disjunct  of  conjunctions  of  base  predicates.  Since  programs  such  as  transitive  closure  are  not  first 
order  properties  [1],  in  general  the  disjunct  is  infinite.  The  following  infinite  sequence  defines  the 
transitive  closure  of  a  relation  R  (commas  are  to  be  interpreted  as  and): 
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R(X,Y) 

R(X,Z0),r(Zo,Y ) 
R(X,Zo),R(Z0,Zt)lR(ZltY) 


R(X,  Z0),  R(Z0,  Zi),  R(Zi,Zj),  R{Z ,,  Y) 


The  first  expression  is  obtained  from  rule  of  program  (//),  the  second  expression  is  obtained 
from  rule  rj  of  program  (//)  with  the  nondistinguished  variable  Z  being  renamed  Z0  and  T(Z,  Y) 
rewritten  as  R(Z,Y),  the  third  expression  is  obtained  from  rule  r\  by  rewriting  T(Zo,Y)  as 
R(Z0,Zi),T{Zi,Y)  using  rule  ri  recursively  and  then  rewriting  T(ZUY)  as  R{Z\,Y)  using  rule 
r*],  and  so  on.  It  is  important  that  we  rename  the  nondistinguished  variables  as  we  expand.  Intu¬ 
itively,  the  first  expression  represents  the  base  relation  R,  the  second  represents  all  tuples  obtained 
t tom  R  via  one  application  of  transitivity,  the  third  represents  all  tuples  obtained  from  R  via  two 
applications  of  transitivity,  and  so  on.  Then  the  transitive  closure  is  defined  to  be  the  union  of 
all  relations  defined  by  these  expressions.  If  there  is  more  than  one  occurrence  of  the  recursive 
predicate,  we  must  systematically  expand  all  occurrences  of  the  predicate  by  means  of  a  selector 
function[6].  In  the  terminology  of  first  order  logic  the  above  infinite  sequence  can  be  written  as: 


{xy  |  *(x,y)} 

{XY  I  &Zo)(RiX,Zo)/\R(Zo>Y))} 

{XY  |  (3Zo)(3Zl)(R(X,Z0)AR(Zo,Z1)AR(ZllY))} 

{XY  |  (3Zo){3Zl){3ZJ){R(X,Zo)AR{Zo,Zl)AR(Zl,Zi)AR(Zi,Y))} 


Finally,  we  call  a  mapping  p  between  variables  of  expressions  e(  and  e*  a  containment  map, 
if  p  maps  each  distinguished  variable  to  itself  and  for  every  literal  P(Xt ,  Xj, . . . ,  Xm)  of  elt 
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then  P(p(JTt),p(Jfi),...lp(JrB,))  is  n  literal  of  e,.  The  next  lemma  states  the  relationship 
between  a  containment  map  and  relation*  defined  by  expressions  et  and  ej. 


Lemma  2.1  If  p  is  a  containment  map  from  e\  to  ej,  then  the  relation  defined  by  e3  is  a 
subset  of  the  relation  defined  by  ei. 

9.  Main  Remit 

In  this  section,  we  will  establish  a  sufficient  condition  to  rewrite  purely  exponential  queries  of 
the  form  (*)  into  the  following  linear  queries: 


S(Xi, ■ . . ,  Xm)  :  -  A(YX%1,  K,,j . n.m),  •  -  •.  MY(n-i), i.  Y(n. ....  ' 

s(n.r,y»,j . n,m).  (•*) 

S(*i,*» . Xm)  A(X,,*, . Xm). 

In  order  to  motivate  the  readers,  we  first  provide  an  example  of  a  purely  exponential  query  of 
the  form  (*)  which  is  not  equivalent  to  a  linear  query  of  the  form  (**). 

Example  3.1  Consider  the  following  two  programs: 


f  S(XltXt,Xt)  S(W,*,,ia  S(XitU,V),  S(T,V,X,). 

P,:  { 

l  S(X,, *,,*,)  :-  A[X i,  *,,*»). 

(S(  *,,*„*,)  A(W,XJtU),  A[Xi,U,V),  S(T,V,X>). 

*(*„*„  X,). 

Let  A  =  {{6, 0,1),  (7, 1,2),  (6, 2, 3),  (8, 3, 4),  (7, 4, 5)}.  In  order  to  see  that  Pt  and  P,  are  not 
equivalent,  we  can  expand  both  programs.  We  observe  that  the  following  expression 

A( Wu  X„  Vt  )A(W,  Ut ,  V,M(T, ,  V, ,  U)A(Xi ,  U,  V)A(T,  V ,  X») 

can  be  obtained  by  first  applying  the  recursive  rule  of  P\  at  the  leftmost  occurrence  of  S  and  then 
replacing  all  occurrences  of  5  by  A.  Now,  by  assigning  (6,0,1),  (7,1,2),  (6,2,3),  (8,3,4)  and 
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(7.4.5)  »o  A(Wi,Xt,Vi),  A{WtVx,Vxl  A[TX,VUU),  A(X,,t/,V)  an d  A(T,V,X,)  respectively, 
we  generate  the  new  tuple  (8,0, 8)  ria  program  Px.  It  it  easy  to  tee  that  the  following  it  the 

t 

infinite  expansion  of  PX: 


et  =A(W,  Xit  U),  A(XU  V ,  V),  A(T ,  V,  X 

e,  =A(W,  Xt,  U),  A(X\,U,  V ),  A(WX ,  V,  Ux ),  A(T,  Ux ,  V,),  A(TX ,  VuXt), 
e,  =A(W,  X>,  U),  A(Xx,  V,  V ),  A(WX,V,  Ux),  A(T ,  Jflt  V,), 

A(Wlt  Vx,Ut),  A(TX,UX,  Vt),  A(Tt,  VJt  Xt ), 


We  note  that  the  first  two  literals  of  ei  is  a  prefix  of  ex  and  the  first  four  literals  of  es  is  a  prefix 
of  e»,  and  so  on.  We  observe  that  ei  and  ej  do  not  produce  the  tuple  (8, 0, 5).  Because  of 
the  way  the  variables  are  chained  in  ei  for  It  >  J,  the  first  five  literals  should  be  assigned  to 
(6, 0, 1),  (7, 1, 2),  (8, 2, 3),  (8, 3, 4)  and  (7, 4, 5)  respectively.  It  can  be  seen  that  the  tuple  (8, 0,  5) 
will  never  be  generated. 

In  the  database  setting,  in  addition  to  the  fact  that  function  symbols  are  not  allowed,  there  is 
another  restriction  which  is  known  as  the  su/e(y  rti/e[2j.  The  safety  requires  that  any  distinguished 
variable  should  also  occur  somewhere  in  the  body  of  the  rule.  Both  function  symbols  and  unsafe 
formulas  cause  nonterminating  computations^). 

Definition  3.1  Let  P(Xx,  Xx, . . .,  Xm)  and  Q(V'i,lj,...,y’m)  be  two  literals,  a  connection 
graph  fr->m  P  to  Q  is  a  directed  graph  on  m  nodes  for  which  there  is  an  edge  from  node  t  to  node 
j  iff  =  yj .  A  purely  exponential  query  is  uniformly  connected  iff  every  two  adjacent  literals  in 
the  body  of  the  rule  have  the  same  connection  graph. 

Example  3.2  Consider  the  following  program: 

S(Xl,X1,Xi)  S(U,Xt,X,)t  S(V,U,U),  S{Xj ,  V,  V). 

S{X i,Xt,Xt)  «(*,,*,,*,). 
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The  connection  graph  from  S(U,  Xt,  X%)  to  S(V,  U,U)  is  shown  in  Pig.  1. 

i 


Fig.  1.  The  connection  graph  from  S(U,  Xj,  Xt)  to  S(V,  U,U) 

Furthermore,  we  observe  that  the  connection  graph  from  S(V ,  U,  U)  to  S(Xi ,  V,  V)  is  also  the 
same  graph  in  Fig.  1.  Therefore,  this  program  is  uniformly  connected. 

Definition  3.2  Let  P  be  the  class  of  purely  exponential  programs  P  satisfying  the  following 
conditions: 

(1)  P  is  uniformly  connected  with  no  isolated  node(i.e.,  for  no  node  i, 
indegree(i)=outdegree(i)=0); 

(2)  Only  adjacent  literals  in  P  have  common  nondistinguished  variables; 

(3)  Every  distinguished  variable  Xt  occurring  at  position  i  of  the  head, 
can  only  occur  at  position  t  of  ail  literals(i.e.,  typed  distinguished  vari¬ 
ables). 

We  will  prove  that  every  program  in  P  can  be  written  in  the  form  (**).  It  should  be  noted 
that  P  is  a  huge  subclass  of  purely  exponential  queries,  in  particular  it  contains  all  binary  chained 
purely  exponential  queries  as  defined  in  [4). 

The  next  lemma  is  instrumental  in  proving  our  main  theorem. 

Lemma  3.1  Every  program  P  in  P  has  the  following  properties: 

(1)  For  no  node  i  in  the  connection  graph  of  P,  both  indegree(i)  and 
outdegree(i)  are  nonsero  unless  i  is  a  stationary  node(i.e.,  there  is  an 
edge  from  i  to  »* ). 

1  We  point  out  that  due  to  the  safety  rule,  stationary  nodes  are  labeled  by  distinguished 


variables. 


(2)  If  indegree(i)^  0,  then  the  variable  at  position  i  of  the  leftmost  literal 
of  P’s  body  most  be  distinguished.  Similarly,  if  outdegree(i)/  0,  then 
the  variable  at  position  t  of  the  rightmost  literal  of  P's  body  mast  be 
distinguished. 

Proof)  (1)  Let  i  be  a  nonstationary  node  with  nonsero  indegree  and  outdegree,  then  by  part(3) 
of  definition  3.2,  every  literal  of  P’s  body  must  have  a  nondistinguished  variable  at  position  i.  This 
implies  that  the  distinguished  variable  at  position  t  of  P’s  head  will  not  occur  in  P’s  body  which  is 
a  violation  of  the  safety  rule. 

(2)  Suppose  indegree(i)^  0.  If  node  i  is  stationary,  then  we  are  done  as  stationary  nodes 
are  labeled  by  distinguished  variables.  Therefore,  suppose  node  i  is  not  stationary,  in  which  case 
again  by  part  (3)  of  definition  3.2,  every  literal  of  P's  body  must  have  a  nondistinguished  variable 
at  position  t.  This  again  violates  the  safety  rule.  A  similar  argument  proves  the  case  for  which 
outdegree(i)^  0.  LI 

The  next  lemma  states  that  if  we  expand  a  program  P  of  P  ,  then  every  expression  in  the 
expansion  of  P  enjoys  the  properties  stated  in  lemma  3.1. 

Lemma  3.2  Let  e  be  an  expression  in  the  expansion  of  P  g  P  ,  then  e  is  uniformly  connected 
and  has  the  same  connection  graph  as  P.  Moreover,  both  properties  (1)  and  (2)  of  lemma  3.1  hold 
for  e. 

Proof:  By  induction  on  k,  where  k  is  the  number  of  applications  of  recursive  rule  of  P. 

Ba*i$  k  =  0.  Obvious  from  lemma  3.1. 

Inductive  Step:  Observe  that  for  every  application  of  the  recursive  rule,  we  increase  the  number 
of  the  literals  by  (n  -  1).  Suppose  that  e&  is  obtained  by  k  applications  of  the  recursive  rule: 

Zlti,  .  .  . ,  Z\<m),  .  .  . ,  ,  Z{  J»—  l  ),X ,  •  •  •  I  Z{p- 1  ),m)i  S(Zp,lt  ...  I  £jr,m)> 

S(*(,+»il  >  ^(p+ 1), It  •  •  •  i  ^(f+l),m)i  •  •  •  i  ^(^n+S(n-  l),li  £«+*(«- 1  ),2i  •  •  •  i  2»+4(n-l),m) 

Now,  if  we  expand  on  pth  occurrence  of  5,  we  have 
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«*+i  =S(zi,i >  zi,»>  •  •  •  i  zi,m), ....  S(Zir _!),* ,  2(,_ i),j.  ■  •  • , 

i 

W.i,  W,,lt ....  W1>m),  5(  Witll  W3,„ . . . ,  W3,n),  ....  S(WnA,  WB,„  . . . ,  Wn,m), 

■^(^(»+t),li  Z(r+l),l,  •  •  •  i  £(*+I),m)i  ■  •  •.  S[ZM+k(n-l),U  Zn+H»-i),»i  •  •  •» 


Obviously,  it  suffices  to  show  that  the  connection  graphs  from  S(Z(>_l)il, £(r_i),s, . . £(r-i),m) 
to  S(W|,j,  Wlt*, . . Wlim)  and  from  S(WBil,  W^.i,  •  •  •,  WBm)  to  5(£(p+i),i'^(r+i).s> . . 2(,+t),m) 
are  the  same  as  P'%  connection  graph.  Let  a  be  an  edge  from  node  i  to  node  j  in  the  connection 
graph  of  P,  then  by  definition  of  the  connected  graph  and  our  inductive 
hypothesis  Z(p-i),i  =  ZrJ-  Now,  when  we  replaced  the  literal  S(ZpA,  ZPy3, . . .,  Zp>m)  by 
S(Ww,W»,i,...)liri,m),S(W,f,tlirM . W,.m) - S(WnA,WnA . WB>m)  in  the  (fc  +  l)th  ap¬ 

plication  of  the  recursive  rule,  the  variables  ZpX,  ZPtt, . Zpm  are  distinguished.  Since  indegree(j)?£ 
0,  by  part  (2)  of  lemma  3.1,  Wtj  must  be  distinguished,  i.e.,  Zrj  —  W^j.  This  implies  that 
Z(p-ij)  =  Wt  j,  and  therefore  there  is  an  edge  from  node  i  to  node  j  in  the  connection  graph  from 
S{Zlr-i),i'Zb-i),*f  tZ(r-\),m)  to  S(WM,tVi,,....,lV1>m).  Furthermore,  the  stationary  nodes 
will  remain  stationary.  Finally,  as  we  rename  the  nondistinguished  variables  in  any  application  of 
the  recursive  rule,  we  will  not  create  any  edge  which  already  does  not  exist  in  the  connection  graph 


In  order  to  show  that  the  connection  graph  from  S(  WBij,  W*,  WB>m)  to 
S(z(r+i),t*  •£(*+!), ji  •  •  •  ,  %4i)>ffl)  is  the  same  as  P’s  connection  graph,  again  let  a  be  an  edge 
from  node  •  to  node  j.  We  will  show  that  Wn>i  =  2(*+i)j-  Since  the  assertion  holds  for  ek  by 
inductive  hypothesis,  the  definition  of  connection  graph  implies  that  Zp(  —  )j-  Also,  as  in 

the  above  case,  since  ZpA,  Zp>3, . . .,  Zp>m  are  distinguished  variables  for  the  (h  +  l)th  application 
and  outdegree(t)/  0,  by  part  (2)  of  lemma  3.1,  Wi  n  is  distinguished,  i.e.,  Wn  i  =  ZpA.  This  implies 
that  ZpA  =  Wn  i  —  Z(p+i)  i.  Hence,  there  is  an  edge  from  node  i  to  node  j  in  the  connection  graph 
from  S(  WBii ,  WB> j, . . . ,  tVnjm)  to  5(Z(p+i),t ,  Z(p+i)i3, . . . ,  £(p+i),m).  □ 

Theorem  3,1  Let  P  G  P  ,  then  P  is  equivalent  to  a  program  of  the  form  (**). 

Proof:  Let  P'  be  the  corresponding  program  of  the  form  (**),  we  will  show  that  P  is  equivalent 
to  P'.  Let  E  and  E1  be  the  expansions  of  P  and  P',  respectively.  Let  e  €  E  and  suppose  e  is 
obtained  by  k  applications  of  the  recursive  rule  in  P,  then  e  has  the  form: 


«e  * 


e  =  S(Z\'i,  Zl't,  .  .  . ,  Z  I,m), . . . ,  l).l.  •  •  •  t  2«+»(»-l),m) 


Let  e'  be  the  expression  obtained  from  It  applications  of  the  recursive  rule  in  P'  (observe  that  we 
can  only  expand  on  the  rightmost  literal),  then  e'  has  the  form: 


e'  =  S(W,,lf  Wtli, ....  Wt,m), . ...  S(Wn+k{n_lhl,  Wn+k(n_l)>2 . WB+k(._1)tm) 

Let  p  :  e  — *  e'  defined  by  p(Z{j)  -  W(j  for  j  =  1,2 .  m  and  s  =  1,2,...,  n  +  fc(n  -  1). 

We  will  show  that  p  is  a  containment  map. 

By  lemma  3.2,  both  e  and  e‘  have  the  same  connection  graph.  We  first  prove  that  p  is 
well-defined.  Suppose  Z{j  =  Zvj>,  then  we  need  to  show  that  Wjj  —  W<, j>.  In  case  Zij  is 
distinguished,  then  by  part  (3)  of  definition  3.2,  j  =  j'.  Now,  by  part  (1)  of  definition  3.2,  node 
j  most  be  stationary.  Hence,  Ztj  =  Zpj>  =  =  Wi<j>  =  X,  for  some  distinguished  variable 

X.  If  Zi j  is  nondistinguished,  then  by  part  (2)  of  definition  3.2,  Ztj  and  ■  j<  must  either  occur 
in  two  adjacent  literals  or  the  same  literal.  In  ease  they  occur  in  adjacent  literals,  by  definition  of 
the  connection  graph,  there  must  be  an  edge  from  node  j  to  node  j'.  Since  both  e  and  e'  have  the 
same  connection  graph,  then  Finally,  if  they  both  occur  in  the  same  literal,  again 

by  the  fact  that  e  and  e'  have  the  same  connection  graph  it  follows  that  Wij  =  j>. 

In  order  to  show  that  pis  a  containment  map,  we  observe  that  S(p{Zr,  t  ),p[Zp<2),  ...,p(Zr>m))  — 
S(Wfit ,  WFij, . . .,  Wr  m).  All  that  remains  to  be  shown  is  that  p  maps  distinguished  variables 
to  distinguished  variables.  Let  =  X  be  a  distinguished  variable.  If  node  j  is  stationary, 
then  X  occurs  at  position  j  of  all  literals  in  both  e  and  e'.  Hence,  p(Zij)  —  W,j  =  X.  If 
indegree(j)^  0,  then  by  lemma  3.2,  Zij  =  X  =  Wi,j  =  p(Z\j).  If  outdegree(y)^  0,  then  by  lemma 
3.2,  =  X  =  Wn+Un.tjj  =  p(Zn+l(n_ jjj ).  This  shows  that  every  expression  in  E  can 

be  mapped  to  an  expression  in  E' .  The  converse  is  trivial.  □ 

I.  Guessarian  [4]  has  shown  that  binary  chained  purely  exponential  queries  can  be  written  as 
linear  queries  by  using  a  very  elaborate  fixed  point  technique.  This  result  follows  immediately  from 
theorem  3.1. 
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Corollary  3.1  Binary  chained  purely  exponential  queries  can  be  written  as  linear  queries. 


4-  Progressively  Weaker  Chain  of  Template  Dependencies 


A  full  template  dependency TD)  is  a  formal  statement  r  of  the  form: 


where  for  «'  =  1, . . . ,  m,  Xi  =  for  some  1  <  j  <  n  and  1  <k  <  m. 

In  [3],  an  infinitely  weaker  and  stronger  sequence  v0,  rt,  tj,  ...  of  template  dependencies  is 
constructed  via  the  TD  graph.  These  sequences  have  been  used  to  establish  various  results  regarding 
TDs.  We  will  show  here  that  the  expansion  of  program  (/)  for  transitive  closure  of  a  relation  R 
will  provide  a  natural  example  of  an  infinitely  weaker  chain.  We  will  use  the  notation,  r  |=  <r,  to 
state  that  TD  «r  is  a  logical  consequence  of  r. 

Theorem  4.1  There  exists  an  infinite  sequence  of  full  TDs  ru,  n,  r7l  r*,  ...  such  that 
t=  T1+1  for  each  t,  i  =  1, 2, 3, . . .  and  no  two  r-jS  are  equivalent. 

Proof:  Fot  simplicity,  we  will  drop  the  quantifiers  from  the  TDs’  notation.  Let 

ri>>  ri.  tj,  r»,  ...  be  the  following  expressions  in  the  expansion  of  the  transitive  closure  of  R. 
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t*:  R(XtU)Ml(U,Y)->  R(X,Y). 

r ,  :  R(X,  lh)  A  R(Uu  V3)  A  R(Vt,  Y)  -  *(X,  Y). 

r>  :  R(X,  Ux)  A  R(UX,  Ut)  A  *([7,,  V,)  A  A(t/,,  tt.)  A  R(Ut,  Y)  -  *(X,y). 

:  R(X,  Ux)  A  *(17,,  U»)  A  *(17,,  C/,)  A  R(Ut,  Ut)  A  *(lf„  If.)  A  R(Ut,  V, ) 

A  R(Vt,  VT)  A  A(C/Tt  V,)  A  R(Ut,  Y )  -  R(X,  V). 


n  :  aR{Vuv3)a...a t^)  a  xft^.y)  -  *(X,y). 

n+1  :  R(X,Ul)AR(VuUt)A...AR(Uii+i„l,V3>+,)KR(Uii+,,Y)  -»  N(X,y). 


We  first  show  that  rx  )=  r3  (the  general  case  is  the  obvious  generalisation  of  this).  Let  r  satisfy 
rt,  and  suppose  that  (a,  ax),  (<M,  aj),  (a,,  a,),  (a,,®^,  (a4, 6)  ate  tuples  of  r  mapped  to  the  hypothesis 
tows  of  rj,  respectively.  We  want  to  show  that  (a,  6)  is  also  a  member  of  r.  Since  rt  holds  in  r, 
if  we  map  (a,ai),(at,a,),(a,,as)  to  (X,  U\)(Ux,U3)(Uj,Y)  respectively,  then  we  must  have  tuple 
(a,as)  in  r.  Now,  map  (a,aj),(a,,a4),(a4,6)  to  the  hypothesis  of  rt,  again  since  r,  holds  in  r,  we 
must  have  (a,  b)  in  r. 

To  show  that  n  rJ+1 ,  let 

*■  =  {(x.^i),  Vi, u3), ....  uv+,),  (c/,.+.,y),  (x,y)} 

i.e.,  a  relation  consisting  of  all  tj+1’s  rows.  If  we  map  (X,  lh),  (f/lt  f/j),  . . . ,  (f/,*,  I/J<+I)  to  the 
hypothesis  rows  of  rjt  then  for  r  to  satisfy  n,  tuple  (X,  f7j.  +  1 )  must  be  in  r,  which  is  not  according 
to  onr  construction  of  r.  □ 


5.  Conclusion 


13 


We  hsve  identified  n  sufficient  condition  to  rewrite  purely  exponential  queries  as  linear  queries. 
Consequently,  as  a  corollary,  we  have  obtained  a  new  proof  for  the  result  of  Guessarian  [4].  In 
addition,  a  natural  infinite  chain  of  progressively  weaker  TDs  is  constructed  via  expansion  of  the 
logic  program  for  transitive  closure  of  a  relation  R.  We  hope  the  techniques  developed  in  this  paper 
motivate  further  research  in  this  area. 
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